Annotating the World Wide Web using Natural Language

نویسنده

  • Boris Katz
چکیده

This paper describes the START Information Server built at the MIT Arti cial Intelligence Laboratory Available on the World Wide Web since December the START Server provides users with access to multi media information in response to questions for mulated in English Over the last years the START Server answered hundreds of thousands of questions from users all over the world The START Server is built on two foundations the sentence level Natural Language processing capabil ity provided by the START Natural Language system Katz and the idea of natural language annota tions for multi media information segments This pa per starts with an overview of sentence level process ing in the START system and then explains how an notating information segments with collections of En glish sentences makes it possible to use the power of sentence level natural language processing in the ser vice of multi media information access The paper ends with a proposal to annotate the WorldWide Web

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From Sentence Processing to Information Access on the World Wide Web

This paper describes the START Information Server built at the MIT Artificial Intelligence Laboratory. Available on the World Wide Web since December 1993, the START Server provides users with access to multi-media nformation i response to questions formulated in English. Over the last 3 years, the START Server answered hundreds of thousands of questions from users all over the world. The START...

متن کامل

Using Generalized Language Model for Question Matching

Question and answering service is one of the popular services in the World Wide Web. The main goal of these services is to finding the best answer for user's input question as quick as possible. In order to achieve this aim, most of these use new techniques foe question matching. . We have a lot of question and answering services in Persian web, so it seems that developing a question matching m...

متن کامل

Semantic Veri cation of Web Sites Using Natural Semantics

The huge amount of information and knowledge available on the Web leads to the fact that it is more and more diicult to manage this information. Two diierent ways are commonly explored: giving a syntactical structure to Web sites, and annotating their content to facilitate Web mining. In this paper we explore a diierent approach inherited from software engineering: specifying the semantics of W...

متن کامل

A Unifying Approach to HTML

The number, the size, and the dynamics of Internet information sources bears abundant evidence of the need for automation in information extraction. This calls for representation formalisms that match the World Wide Web reality and for learning approaches and learnability results that apply to these formalisms. The concept of elementary formal systems is appropriately generalized to allow for t...

متن کامل

Geographical localization of web domains and organization addresses recognition by employing natural language processing, Pattern Matching and clustering

Nowadays, the World Wide Web is growing at increasing rate and speed, and consequently the online available resources populating Internet represent a large source of knowledge for various business and research interests. For instance, over the past years, increasing attention has been focused on retrieving information related to geographical location of places and entities, which is largely con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997